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^ ■ Abstract 

(N . 

^T) . We examine the phenomenology of particle multiplicity distribu- 

cn I tions, with special emphasis on the low multiplicities that are a back- 

ground to the study of rapidity gaps. In particular, we analyze the 
■^ , multiplicity distribution in a rapidity interval between two jets, using 

^^ I the HERWIG QCD simulation with some necessary modifications. The 

^S^ • distribution is not of the "negative binomial" form, and displays an 

'j^. anomalous enhancement at zero multiplicity. Some useful mathemat- 

Qh! ical tools for working with multiplicity distributions are presented. It 

is demonstrated that ignoring particles with px < 0.2 GeV/c has the- 
oretical advantages, in addition to being convenient experimentally. 



1 Introduction 

The probabilities for various numbers of hadrons to be produced in a high 
energy colhsion, in a fixed region of phase space that is usually defined by a 
range of pseudo-rapidity, is known as the multiplicity distribution. Attempts 
have been made to understand multiplicity distributions on the basis of in- 
tuitive notions of branching and decay of "clusters" |1|, 0] . Approaches with 
an explicit basis in QCD have also been made for regimes where an under- 
lying hard scattering permits perturbative techniques 0. QCD simulation 
programs such as HERWIG [Q include elements of both of these approaches. 

Previous work on the multiplicity distribution {Pn} has centered on KNO 
scaling and its violation |p, "intermittency" 0, and the factorial moments 
(n), (n(n — 1)),. . . 0. These related concepts emphasize average and larger- 
than-average multiplicities, which reflect the multiple soft jet production that 
is characteristic of QCD at high energy. In this paper, we instead focus on 
the region of low multiplicity. 

At the extreme low end of the multiplicity distribution, one encounters 
the physics of rapidity gaps, which can be defined as regions of length Ay > 3 
in rapidity that contain no final-state particles. Rapidity gaps offer a unique 
insight into the workings of QCD. They can in principle be made by the 
exchange of a color-singlet object, such as an appropriate state of two or 
more gluons. They can also be considered — by definition — to be a facet of 
the mysterious pomeron that also governs elastic and diffractive scattering. 

A particularly interesting type of rapidity gap occurs when the gap lies 
between two high-p_|_ jets that are widely separated in rapidity and approx- 
imately back-to-back in azimuthal angle [§, P, |l^. In this paper, we study 
the multiplicity distribution in a region between two such jets, resulting from 
non-pomeron physics; since that is an unavoidable background to probing ra- 
pidity gap physics. 

The major detectors CDF and D0 at the Tevatron {pp at a/s = 1800 GeV), 
and Zeus and HI at HERA [|n[] {e~p at a/s = 300 GeV), can be used to study 
rapidity gaps experimentally. However, the range in pseudo-rapidity where 
these detectors are most sensitive, reduced to leave room for jet evidence of 
hard scattering, is not very large.Q It is therefore important to estimate the 

^ The coverage in r] could in principle be extended using scintillation counters as "gap 
detectors". A detector upgrade of this type should be relatively simple, since it is not 
necessary to have fine segmentation to look for zero particles! 



background from fluctuations in "normal" multiparticle production. This is 
the motivation for our study of the multiplicity distribution at small n. 

Quantitative results presented in this paper are based on the QCD Monte 
Carlo program HERWIG |^ . This program incorporates the color connections 
between partons, and therefore includes the natural suppression of rapidity 
gaps that is present in QCD, apart from the possibility of coherent color- 
singlet exchange. It therefore provides a proper model for the background to 
rapidity gap physics. The simulation also models the production and decay of 
many of the known low-mass hadronic resonances. These create short-range 
rapidity correlations that strongly influence the multiplicity distribution in 
small intervals. The Monte Carlo also provides an opportunity to appraise 
the standard practice of substituting the easily-measured pseudo-rapidity 
variable i] = log cot | = | log [{\p\ + Pz)/i\p\ — Vz)\ fo^ the more natural true 
rapidity y = i log \{E + VzMi'E - p^)]. 

There is no guarantee, of course, that Monte Carlo predictions for the 
multiplicity are correct. But it is not unreasonable to expect that formulae 
that will be adequate to parametrize the eventual experiments should be 
at least flexible enough to fit the simulated data. When real data become 
available, one may hope to tune the Monte Carlo parameters to improve the 
accuracy of the simulation. 

Parametrizations based on the simulation may also be useful in correct- 
ing actual data for losses due to incomplete acceptance. This is especially 
important for the major detectors CDF and D0, which were not designed 
to measure particles with transverse momenta below < 0.2 GeV/c. On the 
other hand, we will use the simulation to show in Sect, ^jthat it may actually 
be desirable to neglect particles with very low p±, since that region is overly 
sensitive to contamination by particles produced in the decays of resonances 
that are far away in rapidity, and since particles other than photons at small 
\r]\ are kinematically suppressed there anyway. 

An outline of the paper is as follows. Sect. ^ introduces some useful math- 
ematical tools for working with multiplicity distributions — many of which 
have been suggested previously [0. Sect, ^describes results from a HERWIG 
simulation that is loosely applicable to experiments in progress at CDF and 
D0. Sect. ^ examines the region of very low p^. Sect. ^ summarizes principal 
conclusions. 



2 Theoretical Tools 

2.1 Generating Function 

Our subject is {Pn}, the set of probabilities to observe n particles in an event 
in a selected region of phase space. The region is generally defined in terms 
of pseudo-rapidity 77, or in terms of the Lego variables rj and azimuthal angle 
0. The particles are mainly vr^, and 7 from 7r° decay, with average transverse 
momenta of a few hundred MeV/c. 

The distribution is conveniently represented by the generating function 
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g{x)^^P^x^ (1) 



n=0 



which carries all of the information of {Pn} ■ The factorial moments are 
related to the behavior of g{x) in the limit x —^ 1: 

9(1) = 1 

/(I) = (n(n-l)) 
/'(I) = (n(n-l)(n-2)) 

(2) 

Meanwhile, the low multiplicity region we are interested in is contained in 
the behavior as x — ;► 0: 



^(0) = 


= Po 


^'(0) = 


= Pi 


g"{0)/2\ - 


= P2 


g"'{0)/3\ - 


= P3 



(3) 

In principle, P„ is exactly zero beyond some large maximum n, because 
the energy in the event is finite; so g{x) is a high-order polynomial. In 
practice, however, P„ falls smoothly and rapidly (perhaps exponentially) at 
large n, and becomes immeasurably small long before the maximum value 
is approached. Hence it is appropriate to approximate g{x) by an analytic 



function, whose infinite series converges at least out to |a;| = 1 in the complex 
plane in view of the fact that g{l) = 1. 

The analytic behavior of g{x) can be useful. For if one has an analytic 
expression for g, from a model or simply a parametrization, a convenient 
method to calculate the corresponding probabilities is to integrate g{x)x~'^~^ 
numerically around the unit circle in the complex plane and use Cauchy's 
theorem to obtain P„. 

2.1.1 Cluster Decay Theorem 

The generating function is a convenient tool for analyzing models in which 
"clusters" decay independently to make the observed hadrons. The clusters 
can be low-mass objects such as those assumed in QCD Monte Carlo sim- 
ulations at a low Q^ non-perturbative scale, or the hypothetical objects in 
branching models, or any of the large number of hadronic resonances that 
are the immediate ancestors of most observed hadrons. 

The connection is as follows: if P^^^ is the probability to produce n clus- 
ters and P^^) is the probability for a cluster to decay into n particles, then 
the overall distribution of particles {Pn} is given by the generating function 
relation 
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(x) = g^'\g^'\x)) , (4) 



assuming that the clusters decay independently. Proof of this relation follows 
from the obvious expression: 



oo 



Pn = E Pr E ^i? ■ ■ ■ E Pif 5n.+...-,„„„ . (5) 

j=0 ni=0 nj=0 

The proof is easily generalized to show g{x) = g^^\g'''^\g^^\x))) for indepen- 
dent decay of independently-decaying clusters, etc. 

A simple but useful special case of this theorem applies to detection ef- 
ficiencies. If Q < 1 is the detection probability ("efficiency") for a sin- 
gle particle, one can think of the particle as a 'cluster' with Pi = Q and 
Pq = 1 — Q. The effect of the inefficiency can therefore be expressed by 
g[x]^g[l-Q{l-x)]. 



2.1.2 Independent Sources Theorem 

The generating function is also a convenient tool for analyzing models in 
which the observed hadrons come from two or more statistically independent 
sources. An important example that we will use in Sect. ^.2| occurs in simu- 
lation programs — and perhaps also in nature — where in addition to particles 
resulting from a QCD hard scattering and its associated radiation, there are 
particles in the final state known as the "soft background event", from soft 
interactions between the other partons in the initial composite hadrons. This 
possibility of background particles leads to the notion of a survival probabil- 
ity for rapidity gaps H, |13|. Another example that is important for us are 
particles that appear far outside the cone of a jet, as a result of sequential 
decays of hadronic resonances produced inside the jet cone. 

The relevant theorem is as follows: if two independent sources have prob- 
ability distributions l-P^^-* [ and JP^^U, then both together result in 

g{x) = g^'\x) X g'^^\x) (6) 

Proof follows directly from 

oo oo 

p = V P(^) V P(2) S J. (7) 

ni=0 n2=0 

For the lowest multiplicities, Eq. (|^ takes the obvious forms 

Po = Pi'^P^'^ (8) 

Pi = Pi'^ Pi'^ + Pi'^ Pl,'K (9) 

The theorem can be generalized to 

logg{x) = log g^^^ {x) + --- + log g'^^^ (x) (10) 

for combining N independent sources. Thus on a logarithmic scale, generat- 
ing functions from independent sources are additive. 

2.2 Density Function 

Intuitively, we want to make a smooth parametrization of the multiplicity dis- 
tribution for n > 0, and extrapolate it to ra = to see if there is an anomalous 



contribution that would signal rapidity gap physics. The parametrization is 
not a trivial matter, because Pn varies rapidly with n at small n, especially 
for large rapidity intervals where (n) is large. 

A representation of the probability distribution that I find to be useful 
describes it as a continuous superposition of Poissons: 

POO 

Pn= / dzp{z)e-'z''/n\ . (11) 

Jo 

The density function p{z) is the relative probability to have a Poisson process 
of average multiplicity z. Mathematically, p{z) is the Laplace transform of 
the generating function: 

g{l-x)= dzp{z)e-''' . (12) 

The moments of the continuous distribution p{z) are the factorial moments: 
/q°° p{z) dz = 1, /q°° p{z) z dz = (n), and in general 

p{z) z^ dz = {n{n - 1) ■ ■ ■ {n - j + I)) . (13) 





The independent sources theorem Eq. (m) of Sect. 2.1.2 can be expressed 



in terms of density functions in the form of a convolution integral 



z 



piz)= / pW{z,)pi^\z-z,)dz,. (14) 

Jo 

The density function would not of course have to be positive definite; but 
it turns out to be so for all distributions discussed in this paper. Smoothness 
of p{z) is a good way to express the physical notion that P„ should be a 
smooth function of n, with the possible exception of structure at or near 
n = from the rapidity gap physics we wish to study. The behavior of P„ at 
small n is governed mainly by p{z) at small z. In the extreme, a term 0(:6{z) 
would contribute to Pq only. 

A convenient way to determine p{z) from data in an experiment or simu- 
lation is to fit the data to a parametrization whose transform is known. We 
will do this using a sum of terms of the form z^~^ e~^^, which correspond to 



the NBD discussed in Sect. 2.3. From the simulation, we will find empirically 



that terms with k > 1 describe most of the distribution, so p{z) -^ like 



a power as 2; — >■ 0. To allow for the possibility that p(0) 7^ 0, one can also 
include a term of the form 

(1 + 6^)6-''^ (15) 

which has p'{0) = 0. 

2.3 Negative Binomial Distribution 

The Negative Binomial Distribution (NBD) is a popular phenomenological 
form for multiplicity distributions. It is defined by 



n + k — 1 \ f k \ ( n 



\ k — 1 J \k + n) \k + n 
where 

f n + k-1 \ _ k{k + 1) ■ ■ ■ {k + n - 1 



\ k-1 J n\ 

It can be conveniently computed with the recurrence relation 



(16) 



(17) 



Po = (l + n/k)-" (18) 

---(:^)(^)-»- (-) 

Its factorial moments are given by (n) = n and in general 

^n{n-l)---{n-3 + l)) = k{k + l)---{k + 3-l) [j^^ . (20) 

Its generating function is 

g{x) = [l + {l-x)n/k]-'' . (21) 

Eq. (pID implies 

g"g/ig'Y = l + l/k, (22) 

which could be used to test whether a distribution is of the NBD form. A 
related test would be to see if g/g' is a linear function of x: g{x)/g'{x) = 
l/n+ (1 - x)/k. 
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The density function for the NBD is 

pW = t|)^'-'^-" (23) 

where b = k/n. It has a single peak at z = n(l — 1/k) if A; > 1, or peaks at 
2 = for < A; < 1. As described in Sect. |2.2| , a convenient way to determine 
p{z) from experiment or simulation is to fit {Pn} to a superposition of NBD 
terms and then use Eq. (|23|) to get p{z). 

In the limit fc — >■ cxd, the NBD reduces to a Poisson distribution corre- 
sponding to uncorrelated production: P„ = (?7,"/n!) e"", g{x) = e"*^^"^)", 
p{z) = 6{z — n). 

The multiplicity distributions we are interested in display a single max- 
imum with various degrees of broadness, and fall rapidly at large n. The 
two free parameters of the NBD suffice to fit the first two moments (n) = n 
and (n^) = n^{l + 1/n + l/fc), and hence the NBD can provide at least a 
qualitative description of the probabilities Pn where they are large. We will 
see, however, that a single NBD does not fit our distributions in detail. 

Experimentally, the main published data on multiplicity distributions in 
very high energy hadron-hadron collisions are those of UA5 for charged par- 



ticles in minimum bias non-diffractive events at y^ = 900 GeV |jl^. The 
data for rapidity intervals Ar^ < 2 are well described by NBD distributions. 
For 2 < At] < 5, the data are close to NBD, although the NBD fits are not 
statistically acceptable. However, these data come from only a few thousand 
events, and therefore have rather large statistical errors where P„ is small. 
They also have large systematic errors at low multiplicity, where efficiencies 
are hard to determine. Hence the NBD form might not be magic. NBD 
distributions have also been seen in other data, including e~^e~ -^ hadrons 



[H, |T^ and nucleus-nucleus with low statistics [|T6 |. 

A systematic study of multiplicity distributions in minimum bias and/or 
various hard-scattering processes at the Tevatron has yet to be carried through. 



although preliminary results from CDF have been presented |T^. Some use- 
ful information has been obtained by E735 [0. It would seem that D0 
could directly extend the measurements of UA5 to y/s = 1800 GeV, since 
their lack of a magnetic field simplifies the tracking of charged particles at 
low momentum. 



3 Monte Carlo Simulation 

3.1 Hard Scattering 

The QCD Monte Carlo program HERWIG 5.7 0] was used to simulate pp 
scattering at the Fermilab Tevatron energy ^/s = 1800 GeV, for final states 
that contain two relatively high p± jets separated widely in rapidity. We 
will examine the multiplicity distribution in the interval between these two 
"trigger jets". 

Specifically, we require two jets with p]_' , p]_' > 30 GeV/c, —3.5 < rji < 
— 1.5, 1.5 < rj2 < 3.5, and \r]2 — ?7i| > 4. We require there to be no additional 
jet with p_|_ > 30 GeV/c elsewhere in the event. The jets are defined by a 
cone algorithm that I have used previously [jl9[, with a cone radius of 0.7 
in Lego. This configuration is interesting for gap physics. It is also a good 
one to study from an experimental standpoint, because the region between 
the jets, in which the multiplicity is to be measured, is in the best region 
of the detectors. Indeed D0 has already published data for a rather similar 
configuration ^0|, and further data from both D0 [^ and CDF |2^ will be 



forthcoming. 

In leading-order QCD, the exchange of transverse momentum between 
the partons that produce the trigger jets is accompanied by an exchange of 
color. As a result, one expects lots of gluon radiation, and hence average 
multiplicities greater than those seen in minimum bias events, in the interval 
between the jets. However, if color-singlet exchanges are significant, e.g., in 
the form of gluon ladders, one can also expect to observe some events with 
rapidity gaps [|, |, [l^ . 



HERWIG includes all possible QCD 2 — i> 2 tree diagrams for the hard 
scattering. Among these diagrams, gluon exchange dominates over quark 
exchange because of the large rapidity separation and the gluon's higher spin. 
Also, the scattering partons are mainly q and q because the large sub-energy 



.(1) ^(2) 



(1) (2) 1^(1) -,,(2) 



s = 2p'^'pf [coAi{y^'^ - y^'^) - cos(0^^^ - 0^'OJ = PlPl e'"^ '"^^ ^I (24) 

requires them to have large momentum fractions Xi and X2, which are sup- 
pressed more strongly for gluons by the parton distribution functions. 

HERWIG is appropriate for this simulation because it correctly includes the 
color structure of the QCD hard scattering. It also includes the production of 
many of the actual low-mass hadronic resonances, which have an important 
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influence on the multiplicity distribution. A final important feature is that 
the program contains no color-singlet exchange, or pomeron physics in any 
other form, so it provides a clean model for the background to rapidity gaps. 
The most recent version 5.7 of HERWIG [|] was used, with its default pa- 
rameter values except for ptmin = 30 GeV/c to suit our desired jet p^'s, and 
PRSOF = which will be discussed in Sect. |3^ . It was necessary to modify the 
off-the-shelf program to remove unphysical behavior that otherwise appears 
for our rare final state, as follows. The underlying 2 — i> 2 cross section in 
HERWIG is evaluated for on-mass-shell partons. But the partons are actually 
off shell as a result of the initial state radiation branchings that are a prin- 
cipal feature of the program. To enforce a reasonable consistency, we reject 
events for which the squared four-momentum of either initial parton is larger 
in magnitude than 

Q^ = 2siu/s'^Pu^ , (25) 

a symmetric measure of the hardness of the scattering.^ Events in which 
either observed jet axis differs by more than 1.0 in rapidity from the scattered 
parton (ihep = 7, 8) responsible for it are also rejected. This cut removes 
only 10% of the events. It guarantees that the trigger jets, which are the two 
largest p± jets in the event, come from the underlying hard scattering, as they 
should to be consistent with the approximations on which the simulation is 
based. 

The HERWIG program was modified to improve its efficiency for gener- 
ating events that satisfy our cuts, with no further change in content, by 
replacing its uniformly random generation of the two final rapidities in the 
2^2 subprocess with an appropriately peaked one. This of course required 
the event weighting to be handled by the user's program. 

Fig. 1 shows the multiplicity distribution, based on 130,000 Monte Carlo 
events, for particles with p±_ > 0.2 GeV/c in a rapidity interval of length 
2.5 centered between the two jets in each event. The center of the interval, 
(^fjW _|_ 7^(2)^/2, is distributed with a mean of and a standard deviation of 
0.36 . The nominal jet cones lie entirely outside the interval, since the jet 
axes are at least 4.0 = 2.5 + 2 x 0.75 units apart. 

The dashed curve in Fig. 1 shows an attempt to fit the distribution with 
a negative binomial form. Although it has the correct qualitative behavior, 

•^ This modification is necessary to avoid unphysical behavior in HERWIG 5.7 even 
though a bug corrected in that version improved the situation as compared to version 5.6. 
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the single NBD does not accurately represent {Pn}-^ The parameters of the 
fit shown (n = 14.21, k = 3.78) were chosen to match (n) and (n^), but 
other choices don't work much better. The NBD fit is particularly poor in 
the region of small n that is our major interest. 

The solid curve in Fig. 1 is a good fit to {Pn} for all n ^ 0. This fit has 
a good x^, and continues to fit at larger values of n (not shown), all the way 
out to n ~ 60, beyond which statistical errors become overwhelming. The fit 
consists of a sum of two NBD terms. However, the density function represen- 
tation p{z) in Fig. 2 (solid curve) shows that these two terms do not describe 
distinct peaks, but rather overlap to form a single very smooth distribution. 
The distribution is qualitatively similar to, but somewhat narrower than, the 
single NBD approximation (dashed curve). 

Although the solid curve in Fig. 1 is a smooth fit to the n ^ data, 
its extrapolation to n = is 0.0018, which underestimates the actual Pq = 
0.0035 by a factor of 1.9 . This raises a warning flag for rapidity gap searches, 
where the signal would correspond to just such an "extra" probability for 
n = 0! A similar but even stronger effect occurs if all particles are included 
instead of just those with p± > .2 : the actual value is Pq = 0.0014 while 
the extrapolation gives 0.0003. Similar behavior also occurs if the interval 
is defined using true rapidity Ay = 2.5 in place of Arj = 2.5 : Pq = 0.0011, 
fit = 0.0002; or for a larger interval such as Ar] = 3.0: Pq = 0.0011, fit = 
0.0002. The effect is present but somewhat smaller if only charged particles 
with p±_ > 0.2 are counted: Pq = 0.0093, fit = 0.0064, or if charged particles 
with all p± are counted: Pq = 0.0052, fit = 0.0039. The effect remains if all 
hadron resonances are made stable instead of being allowed to decay: in the 
interval Ay = 2.5 we have Pq = 0.0033, fit = 0.0014, or in the longer interval 
Ay = 3.0 we have Pq = 0.0016, fit = 0.0005. 

These results indicate that in order to establish a rapidity gap signal ex- 
perimentally, the signal will have to be large compared to the background 
estimated by extrapolation from larger n, since the extrapolation can under- 
estimate the non-pomeron contribution. 

The dotted curve in Fig. 2 shows the density function for a parametriza- 
tion that fits {Pn} in Fig. 1 for all n. The parameterization contains a term 

^ This result is not inconsistent with a previous claim [g3| that Monte Carlo simula- 
tions are NBD. That claim is based on only 2000 events, implying large statistical errors 
wherever P„ is small; and even with the large statistical errors, many of the fits described 
are inadequate in the sense of x^ . 
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of the form Eq. ( pTS]) that allows the extra probability for Pq to appear as 
structure in p{z) at very small z with p(0) 7^ 0. 

3.2 Background Event 

The colliding p and p hadrons are extended objects containing many partons. 
Events of the type we are interested in, where two partons interact to produce 
jets, will generally occur only in collisions for which the impact parameter 
is small. There are likely to be additional soft interactions between other 
constituents, of the same character as those of the typical "minimum-bias" 
interactions that account for the fact that the inelastic interaction probability 
is nearly 1.0 at small impact parameter. These additional interactions lead 
to the production of particles known as the "soft background event" , which 
raise (n) and decrease Pq. 

It is reasonable to assume that the soft background particles are statisti- 
cally independent from the particles we have considered so far. This allows 
us to compute the final {Pn} by combining fits to its hard and soft compo- 
nents, using the g{x) = g^^\x) x g^'^\x) theorem (Eqs. (i)-(0)) or a simple 
Monte Carlo. This is a significant technical help to the calculation, since Pq 
is so small that it would otherwise require extremely many events from the 
QCD simulation to determine it accurately. 

The HERWIG package contains a model for the soft background event, 
which was turned off by setting the parameter prsof = to obtain the hard 
scattering results discussed in Sect. ^]l] above. Turning it on in every event 
via PRSOF = 1 leads to soft background particle distributions that are well 
described (at 40, 000 event statistics) by single NBD distributions, except for 
a sizable extra contribution to Pq. The HERWIG model for the background 
event is based on the UA5 data, so it is perhaps not surprising that it has 
an NBD form, although this result is not obvious, since the model actually 
assumes an NBD form for clusters rather than for final particles. I have 
checked that, at any rate, the model predicts charged-particle multiplicity 
distributions consistent with those observed by UA5. The distributions are 
rather broad in the sense that the NBD parameter k is small. For example, 
for particles in the region Arj = 2.5, p± > 0.2 corresponding to Figs. 1- 
2, the background NBD parameters are k = 1.8 and n = 15.2, with an 
extra contribution of 0.03 to Pq. The origin and/or validity of the "extra" 
contribution, which makes Pq larger than any other single Pn, is unclear; so 
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I have tried computing the final {Pn} both with it and without it. 

Including the background event, with the extra contribution to Pq "on", 
changes the multiplicity distribution of Fig. 1 to that shown in Fig. 3a. An 
expanded view showing the details at small n is shown in Fig. 3b. The mean 
(n) has become larger, since it is equal to the sum of the means from the 
hard scattering and the background; while the probabilities at small n have 
become much smaller since, e.g., to get the extreme case n = there must 
be no particles in the interval from the hard scattering and also none from 
the soft background event (Eq. (||)). The important qualitative conclusions 
of Sect. |3.1| remain true with the background event included, however: 



• {Pn} is quite similar to a single NBD form (dashed curve, based on 
n = 28.88, k = 4.54 which fit (n) and (n^)); but the single NBD does 
not provide a fully acceptable fit, and is particularly unsuitable for 
describing the low multiplicities. 

• {Pn} can be fit very well for all but the lowest values of n by a sum of 
two NBD terms (solid curve). 

• The accurate two-NBD fit corresponds to a single smooth peak in the 
Poisson density function p{z) (Fig. 4), which is slightly narrower than 
the single NBD approximation. 

• Pq is larger than the fit at n = by roughly a factor of 2. 

Fig. 3 is based on 1,000,000 events. This large number of events was used 
to obtain sufficient statistics to show the behavior at small n clearly, in view 
of the rather small values of Pn there. However, the inadequacy of a single 
NBD fit already sets in for > 50, 000 events. 

The data in Fig. 3b can be fit by a single NBD term, with the normal- 
ization treated as a free parameter, over the small region 1 < n < 12 . This 
provides an alternative way to extrapolate to n = 0. It is consistent with the 
result of using the two-NBD fit to the entire n > distribution. 

The entire distribution in Fig. 3, including n = 0, can be fit by a sum 
of two NBD terms plus a term of the form Eq. ([T5|) that allows p(0) ^ 0. 
The size of this term is in fact so small that its effect would not be visible 
in Fig. 4. Philosophically, however, one does not expect contributions with 
finite probability density at average particle number zero, apart from true 
rapidity gap processes. 
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If, instead of counting particles, one counts cells ( "towers" ) of size 0.1 x 0.1 
in (?7, 0) space that have E± > 0.2, as is done in calorimeter detectors, there 
is essentially no change in the above results, since these cells are so small 
that it is rare for more than one particle to enter a given cell, even at the 
higher values of n. 

An attempt was made to estimate the effect on the multiplicity distribu- 
tion of the geometric acceptance and detector efficiency in the current D0 
experiment [^]. The assumed geometric acceptance was 1.0 (perfect) for 



\r]\ < 1.1, with a "hole" corresponding to the edge of the central calorimeter 
(0.0 for \r]\ = 1.2 - 1.4, 0.5 for |r/| = 1.1 - 1.2 and \r]\ = 1.4 - 1.5), and a 
linear fall-off from 0.7 at |?7| = 1.5 to 0.1 at |?7| = 3.2. The assumed efficiency 
for photons rises steeply from at p = 0.2 GeV/c to 0.94 by p = 1 GeV/c. 
The assumed efficiency for charged particles rises more slowly, reaching 0.54 
a.t p = 1 GeV/c and 0.80 at p = 3 GeV/c. For intervals of length At] = 3.0 
between the jets, with soft background particles included, the multiplicity 
distribution is close to a single NBD with n = 14.0 and k = 4.4 . Deviations 
from the single NBD fit begin to appear clearly when the number of events 
is > 50, 000. A sum of two NBD terms fits the distribution even for ~ 10^ 
events, with very little anomalous contribution to Pq = 0.0014 . 

4 Edge Effects and Transverse Momentum 

Figs. 1-4 are based on counting particles with p±_ > 0.2 GeV/c. One reason 
to require a minimum p±_ is to mimic typical experimental acceptance. But 
by using the QCD simulation to study the types and origins of particles that 
contribute to the multiplicity distribution, we will see that the p± cut also 
provides some theoretical benefits. 

Let us focus in particular on the interval of length At] = 2.5 centered at 
T] = ± 0.36 considered in Sect. Ol. The composition of particles is 51% 



7, 39% TT^, 6% K'^jKl, and 3% p,p,n,n. The fiood of photons is caused 
by the fact that true rapidity is the proper measure of longitudinal phase 
space. The Lorentz frame-specific pseudo-rapidity r] and the true rapidity y 
are related by 

sinh?7 = Pz/p± (26) 



sinh?/ = Pz/yp'i + m'^ ■ (27) 
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These become equal for \p±\ ^ m, as is always the case for the niassless 
photon; while particles whose transverse momentum is small compared to 
their masses are swept out to \ri\ > \y\ .0 Imposing the cut p± > 0.2 GeV/c 
eliminates much of this pseudo-rapidity effect, and changes the composition 
to 34% 7, 52% 7r±, 9% K^^K^, and 5% p,p,n,n. 

A benefit of imposing the minimum p±_ cut, in addition to experimental 
convenience, is that it tends to eliminate particles whose pseudo-rapidities 
are unrepresentative of the actual underlying physics. For example, low p± 
photons generally come from the decay of a 7r° whose true rapidity differs 
from that of the photon by something on the order of 0.5 units. Furthermore, 
the 7r° may come from the decay of a p^ or other low-mass resonance that 
is still further away; and that resonance may itself be a resonance decay 
product. From the standpoint of rapidity gap physics, one would like to 
think of 71, p, or higher-mass resonances alike as stable. Decay effects are 
simply a source of "noise" in the measurement of the rapidity of produced 
hadrons. This noise is significant because the intervals we are looking at are 
not very many units long in rapidity. 

The HERWIG simulation leads to the following quantitative results for 
the Ai] = 2.5 interval. If all particles are included, 16% come from parent 
particles or resonances produced outside the corresponding interval Ay = 2.5 
of true rapidity. This includes 5% from parents that are more than 0.5 units 
outside the interval. Imposing the cut p± > 0.2 GeV/c improves the situation 
considerably: only 9% come from parents outside the true interval Ay = 2.5, 
with only 1% more than 0.5 units outside it. 

Rapidity gaps are traditionally defined by a total absence of particles 
in a particular interval of pseudo-rapidity. We should not be single-minded 
about this, however, since as shown above, neglecting particles with p± < 
pUm ^ 2 GeV/c substantially improves the connection between the pseudo- 
rapidities of the long-lived or stable particles that are measured and the true 
rapidities of their parent hadrons. 

An additional motive for neglecting low p±_ particles, aside from exper- 
imental practicality, is the following: the theoretically significant variables 

^ Attempts p4, E5| to measure the average charged muhipHcity dNch/drj apparently 
have ignored this pseudo-rapidity phenomenon when extrapolating to account for the 
unmeasured portion of the spectrum at small p^ . They also use a phenomenological form 
dN/dridp\ oc (pj_ + pq)~°' which has incorrect analytic behavior in p\: dN/dydp'^ ex 
(pj_ -|-Po) "^ would be preferable. 
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in rapidity gap physics are generally the invariant masses of the hadronic 
systems to the left and right of the gap. These masses depend on the "+" 



and "— " components of light-cone momentum, Jp\ + rn? e^^, for which 
particles with very low p^ are less important. 

5 Conclusion 

The ideal way to look for rapidity gaps would be to measure the multiplicity 
zero component Pq as a function of interval size Ar/. The background from 
non-gap physics should decline rapidly with increasing Ar] — for example, 
Pq oc e"^^'' roughly describes the results of our simulation in the range 
Ar] ^1 — 3. Any residual constant or slowly varying component at large At] 
is the rapidity gap signal. 

Because current experiments are limited in interval size, and because it is 
necessary to make experimental corrections based on measurements at non- 
zero n, we have studied instead the form of the P„ distribution in fixed regions 
of Arj. One can hope that for Ar] ^ 3, the rapidity gap signal will appear 
as an anomalously large contribution to Pq; when compared to a smooth 
parameterization that describes the rest of the distribution. 

With the help of the QCD simulation program HERWIG, we have found 
suitable ways to parametrize the Pn distribution. A particularly convenient 
choice is a sum of two NBD terms, which gives a very good fit, and automat- 
ically provides a simple parametrization of the generating function g{x) and 
the Poisson density function p{z) . The smoothness of the parametrization 
is demonstrated by the smooth single-peak form of p{z) (Figs. 2, 4). The 
absence of an anomalous contribution (or a gap signal) at very small n can 
be characterized by a power-law behavior p{z) ~ const x z" where a > so 
that p(0) = 0. Meanwhile, the often-used single NBD form has this property, 
and describes the distribution qualitatively (Fig. 3a); but is not good enough 
at small n (Fig. 3b) for measuring the background to a rapidity gap signal 
unless the region of n included in the fit is sharply restricted. 

We find from the simulation that fits to P„ for n > 2 can underestimate 
Pq by a factor ~2. This is a cautionary tale for rapidity gap studies, because 
the excess Pq has the same form as a rapidity gap signal. It will nevertheless 
be possible to measure true gap effects in intervals as small as 2 — 3 units, 
provided the n = cross section turns out to be large compared to the 
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extrapolation from larger n. This will happen if the signal turns out to be on 
the order of 1% or larger. Results from the experiments are eagerly awaited! 
Further work is needed to create a phenomenological model to describe 
the rapidity gap physics itself, which contributes not only to n = 0, but also 
to other low multiplicities since the gap in a given event can be just slightly 
shorter than the rapidity interval under consideration, and since there are 
edge effects associated with resonance decays as discussed in Sect. Ij. 
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Figure Captions 

1. Multiplicity distribution for particles with p± > 0.2 in a pseudo-rapidity 
interval Arj = 2.5 centered between jets, with "soft background event" 
turned off. Solid curve: good fit for n > 1; dashed curve: single NBD 
fit. 

2. Poisson density function representations p{z) of the fits shown in Fig. 1. 
Dotted curve: density function of fit to all data, including n = 0, in 
Fig. 1. 

3. (a) Multiplicity distribution similar to Fig. 1, with "soft background 
event" included, (b) Expanded view of the distribution at small n. 
Solid curve: good fit for n > 2; dashed curve: single NBD fit. 

4. Poisson density function representations of the fits in Fig. 3. 
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